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Abstract. We introduce and study three different notions of tropical rank 
for symmetric and dissimilarity matrices in terms of minimal decompositions 
into rank 1 symmetric matrices, star tree matrices, and tree matrices. Our 
results provide a close study of the tropical secant sets of certain nice tropical 
varieties, including the tropical Grassmannian. In particular, we determine 
the dimension of each secant set, the convex hull of the variety, and in most 
cases, the smallest secant set which is equal to the convex hull. 



1. Introduction 

In this paper, we study tropical secant sets and rank for symmetric matrices. 
Our setting is the tropical semiring (K. U {cx)},®,©), where tropical addition is 
given by X ® y = min(x, y) and tropical multiplication is given by x(z)y = x + y. 
The kth tropical secant set of a subset V of M.^ is defined to be the set of points 
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where © denotes coordinate-wise minimum. This set is typically not a tropical 
variety and thus we prefer the term "secant set" to "secant variety," which has 
appeared previously in the literature. The rank of a point x 6 with respect 
to V is the smallest integer k such that x lies in the fcth tropical secant set of V , 
or 00 if there is no such k. 

In [7], Develin, Santos, and Sturmfels define the Barvinok rank of a matrix, not 
necessarily symmetric, to be the rank with respect to the subset of n x n rank 1 
matrices, and their definition serves as a model for ours. In addition, they define 
two other notions of rank, Kapranov rank and tropical rank, for which there arc no 
analogues in this paper. Further examination of ranks of not necessarily symmetric 
matrices can be found in the review article [1]. 

We give a careful examination of secant sets and rank with respect to three 
families of tropical varieties in the space of symmetric matrices and the space of 
dissimilarity matrices. By a n x n dissimilarity matrix we simply mean a function 
from ('2') to ]R, which we will write as a symmetric matrix without any entries on 
the diagonal. There is a natural projection from n x n symmetric matrices to n x n 
dissimilarity matrices which we denote by vr. For example, 
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and 7r(A/) 
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are a symmetric matrix and dissimilarity matrix respectively. 
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Figure 1. Weighted tree whose distance matrix is 7r(A/) from dT]). 

Our first family is the tropical Veronese of degree 2, which is the tropicahzation of 
the classical space of symmetric matrices of rank 1 . It is a classical linear subspace 
of the space of symmetric matrices consisting of those matrices which can be written 
a,s v'^ Q V for some row vector v. The rank of a matrix with respect to the tropical 
Veronese is called symmetric Barvinok rank, because it is the symmetric analogue 
of Barvinok rank. 

Second, we consider the space of star trees, which is the image of the tropical 
Veronese under the projection tt. Equivalently, it can be obtained by first project- 
ing the classical Veronese onto its off-diagonal entries and then tropicalizing. The 
classical variety and its secant varieties were studied in . The tropical variety is a 
classical linear subspace of the space of dissimilarity matrices, and we call the corre- 
sponding notion of rank star tree rank. The name reflects the fact that the matrices 
with star tree rank 1 are precisely those points of the tropical Grassmannian which 
correspond to trees with no internal edges, i.e. star trees, in in the identification 
below. 

Third, we consider the tropical Grassmannian G2,m which is the tropicahzation 
of the Grassmannian of 2-dimensional subspaces in an n-dimensional vector space, 
and was first studied in |14| . It consists of exactly those dissimilarity matrices 
arising as the distance matrix of a weighted tree with n leaves in which internal edges 
have negative weights. Therefore, we call the points in the tropical Grassmannian 
tree matrices, and call rank with respect to the tropical Grassmannian the tree rank. 
Note that our definition of tree rank differs from that in [12l Gh. 3], which uses a 
different notion of mixtures. 

Our first two families are examples of classical linear spaces, whose secant sets 
were studied by Mike Develin [6] . He defines a natural polyhedral fan such that the 
tropical secant set is the support of this polyhedral fan. Moreover, Theorem 2.1 
in [6] gives an algorithm for computing the rank of a point with respect to a fixed 
linear space. In contrast, we do not know of a good algorithm for computing the 
rank with respect to the tropical Grassmannian (sec Scction[5]), and we do not know 
of a natural fan structure. 

We use our examples of M and 7r(M) from ([T]) to illustrate our three notions of 
rank. Proposition [5] tells us that the symmetric Barvinok rank of M is 4. Theo- 
rem [10] tells us that the star tree rank of Tr{M) is 2. Explicitly, we have 
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Finally, the tree rank of tt{M) is 1 by Proposition [151 which can also be seen 
explicitly from the weighted tree in Figure [TJ This example shows that all three of 
our notions of rank can be different. 

However, for any n x n symmetric matrix Af , we have 

(2) symmetric Barvinok rank(M) > star tree rank(7r(M)) > tree rank(7r(M)). 

The first inequality follows from the fact that the set of dissimilarity matrices of star 
tree rank 1 is the projection of the set of matrices of symmetric Barvinok rank 1. 
The second inequality follows from the fact that the space of star trees is contained 
in the tropical Grassmannian. 

The rest of the paper is organized as follows. In Section[2l we present a technique 
for proving lower bounds on rank. We introduce a graph associated to a matrix for 
each of our notions of rank; the chromatic number of this graph is a lower bound 
on the rank of the matrix. The same technique applies to provide a lower bound to 
the rank of any point with respect to any tropical prcvaricty, although in general 
it may produce a hypergraph instead of a graph. 

We examine symmetric Barvinok rank, star tree rank, and tree rank in Sections 
[31 m and [5] respectively. We prove upper bounds on the rank in each case, and 
with the exception of tree rank, our upper bounds arc sharp. We show that the 
symmetric Barvinok rank of an n x n symmetric matrix can be infinite, but even 
when the rank is finite it can exceed n, and in fact can grow quadratically in n 
(Theorem [6]). For each notion of rank, the set of matrices with rank at most k is 
a union of polyhedral cones, and we compute the dimension of these sets, defined 
as the dimension of the largest cone. In each case, the dimension of the tropical 
secant set equals the dimension of the clasical secant variety, confirming Draisma's 
observation that tropical geometry provides useful lower bounds for the dimensions 
of classical secant varieties [8] . Finally, we give a combinatorial characterization of 
each notion of rank for a 0/1 matrix in terms of graph covers. 

In Section [6l we examine 3x3 symmetric matrices and explicitly characterize 
the stratification by symmetric Barvinok rank. In Sections [7] and [8l we do the same 
for the 5x5 dissimilarity matrices and the stratifications by star tree rank and tree 
rank respectively. In particular, we show that the lower bounds from the chromatic 
number in Section [2] are exact in these cases. We close with some open problems 
in Section [9] 

2. Lower bounds on rank via hypergraph coloring 

Before we examine our three notions of rank, we give a general combinatorial 
construction: a hypergraph whose chromatic number yields a lower bound on rank. 

Recall that a hypergraph consists of a ground set, called vertices, and a set of 
subsets of the ground set, called hyperedges. The chromatic number of a hyper- 
graph H, denoted x{H), is the smallest number r such that the vertices of H can 
be partitioned into r color classes with no hyperedge of H monochromatic. In 
particular, if H contains a hyperedge of size 1, then x{H) is oo. 

Now, suppose we have a tropical prevariety V C R^. Recall that a tropical 
polynomial 

t 

(3) p{xi, . . . , Iat) = xj" • • • x^J" 

i=l 
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defines a tropical hypersurface consisting of those vectors x G R sucli that the 
minimum in evaluating p{x) is achieved at least twice. A tropical prevariety is the 
intersection of finitely many tropical hypersurfaces, and any finite set S of tropical 
polynomials defining the prevariety V is called a tropical basis. 

Now, given a point w G M.^ and a tropical basis S for V, we construct a hyper- 
graph on ground set [N] as follows. Let p from ([3]) be a tropical polynomial in S", 
with all exponents > 0. If the minimum is achieved uniquely when p is evaluated 
at w, then we add a hyperedge E whose elements correspond to the coordinates 
that appear with non-zero exponent in the unique minimal term. The deficiency 
hypergraph of w with respect to V and S consists of hyperedges coming from all 
polynomials in S with a unique minimum at w. In particular, the deficiency hy- 
pergraph has no hyperedges (and thus has chromatic number 1) if and only if w is 
in V. 

Proposition 1. If H is the deficiency hypergraph constructed above, then the rank 
of w G M.^ with respect to V M.^ is at least x{H)- 

Proof. Suppose that w has rank r with respect to V , and let ui = ui © • ■ • © be 
an expression of w as the tropical sum of r points Vi G V . We will construct an r- 
coloring of the deficiency hypegraph iJ , which will show that x{H) < r. For each i G 
[N] , there is at least one Vj which agrees with w in the ith coordinate, so arbitrarily 
pick one such j as the color for vertex i in H. Let E he a hyperedge of H and 
p the associated tropical polynomial. We claim that E cannot be monochromatic 
with color j. Each coordinate of Vj is greater than or equal to the corresponding 
coordinate of w, so each term of p{vj) is greater than or equal to the corresponding 
term of p^w) . On the other hand, the minimum in the evaluation of p{vj ) is achieved 
at least twice, so the minimum must be strictly greater than p{w). Thus, Vj cannot 
agree with w for all coordinates in E, so E is not monochromatic of color j. This 
holds for any color j, so we have constructed an r-coloring, and thus, x{H) < r. □ 

Corollary 2. // the deficiency hypergraph H has a hyperedge of size 1, then the 
rank of w with respect to V is infinite. 

Proof. If H has a hyperedge of size 1, then x{H) is oo, and thus the rank of w is 
infinite. □ 

We do not know of any examples in which this lower bound is actually strict; 
see Section |9l 

For the varieties considered in this paper, we will take quadratic tropical bases 
and thus the deficiency hypergraph will always be a graph (possibly with loops). 
Accordingly, we will call it the deficiency graph. 

3. Symmetric Barvinok rank 

Recall from the introduction that the symmetric Barvinok rank of a symmetric 
matrix M is the smallest number r such that M can be written as the sum of r 
rank 1 symmetric matrices. The 2x2 minors XijXki © xaXkj of M for i ^ k and 
I 7^ j form a tropical basis for the variety of rank 1 symmetric matrices. We will 
always construct our deficiency graph with respect to this tropical basis. 

Our first observation is that the symmetric Barvinok rank of a matrix can be 
infinite. More precisely. 
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Proposition 3. // M is a symmetric matrix and 2Mij < Mu + Mjj for some i 
and j , then the symmetric Barvinok rank of M is infinite. 

Proof. The tropical polynomial x'j^j © XiiXjj is in the tropical basis, so if 2Mij < 
Mii + Mjj for some i and j, then the deficiency graph for M has a loop at the node 
ij. Therefore, M has infinite rank by Corollary [21 □ 

In fact, the converse to Proposition [3] is also true; see Theorem |6l In order to 
construct decompositions into rank 1 matrices, we need the following lemma. 

Lemma 4. Let M be an m x m symmetric Barvinok rank 1 matrix, n > m an 
integer and C any real number. Then there exists an n x n symmetric rank 1 
matrix N such that the upper left m x m suhmatrix is M and every other entry is 
at least C . 

Proof. Since M has rank 1, then AI = v'^ Q v for some row vector v. Let C" ~ 
max{iC, C — Vi}, and let w be the vector consisting of v followed by C repeated 
n — m times. Then N = vo^ w has the desired properties. □ 

Remark 1. We will use the symbol oo in an entire row and column of a matrix to 
denote sufficiently large values that maintain the property of being rank 1. So, if 
M is an m X m rank 1 matrix, then 

"M oo 

oo oo 

denotes the (m+l) x (m + 1) matrix obtained by applying Lemma[4]with n — m+1. 
The value of C will be clear from the context. 

Next, we give a graph-theoretic characterization of the symmetric Barvinok rank 
of 0/1-matrices. We define a clique cover of a simple graph C? to be a collection 
of r complete subgraphs such that every edge and every vertex of G is in some 
element of the collection. Given an n x n symmetric 0/1 matrix M with zeroes on 
the diagonal, define Gm to be the graph whose vertices are the integers [n], and 
which has an edge between i and j if and only if My = 0. 

Proposition 5. Suppose M is a symmetric 0/1 matrix with zeroes on the diagonal. 
Then the symmetric Barvinok rank of M is the size of a smallest clique cover ofGm- 
On the other hand, suppose that M is a symmetric 0/1 matrix with at least one 
entry of 1 on the diagonal. If there exist i and j such that Ma — 1 and Mij — 0, then 
the symmetric Barvinok rank of M is infinite. Otherwise, let M' be the maximal 
principal submatrix with zeroes on the diagonal. The symmetric Barvinok rank of 
M is one greater than the symmetric Barvionk rank of M' . 

Proof. First suppose that all diagonal entries of M are 0. Let Gi, . . . , be a clique 
cover of Gm- Let Vi be the 0/1 row vector whose jth entry is if j is a vertex 
of Gi, and let Mi = vf Vi. Then we claim that M = M^. Each entry in M 
corresponds to an edge or vertex of Gm which is in some Gi, so there is a in Mj. 
For j ^ k such that Mjk = 1, some Gi contains the vertex j, and hence misses the 
vertex k, so {Mi)jk = 1. On the other hand, the entry is at least 1 in the other 
rank 1 matrices because none of the corresponding graphs contains jk as an edge. 
Thus the rank of M is at most r. 

Conversely, suppose A'^ is a rank 1 symmetric matrix in a decomposition of M. 
Since all entries of M are non-negative, so are the entries of N, so the rank 1 
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condition says Nn ~ Njj = if and only if Nij = 0. By the "if" direction, we 
can define a graph Gn whose vertices and edges correspond to the diagonal and 
off-diagonal zeroes of N respectively. By the "only if" direction, this is a complete 
graph. Every position with a in M must be for some rank f matrix in the 
decomposition, so the graphs Gn form a clique cover of Gm as ranges over all 
rank 1 symmetric matrices in the decomposition. Thus, the rank of M is exactly r. 

Next, we suppose M has at least one 1 on the diagonal and let M' C M be as in 
the statement. If there exist i and j such that Mu ~ 1 but A/y = 0, then the rank 
of Af is infinite by Proposition [31 Otherwise, we claim that the rank of Af is r + 1. 
Extending each rank 1 summand of a minimal decomposition of Af ' by Lemma |4] 
and adding in the all ones matrix shows that the rank of M is at most r + 1. On 
the other hand, it is straightforward to check that a rank 1 summand containing a 
1 entry on the diagonal can contain no zeroes, so docs not contribute to a clique 
cover for Gm' . So the rank of M is exactly r + 1 in this case. □ 

Remark 2. This characterization gives us two families of matrices which have rank n 
and [n^/4j respectively, namely those corresponding to the trivial graph with n 
isolated vertices and the complete bipartite graph K\^n/2\,\n/2]- In the latter case, 
^[n/2j.[n/2] is triangle- free, so no clique can consist of more than one edge. On 
the other hand, there are [n^/4j edges in the graph, so [n^/4j cliques are needed 
in a cover. In fact, these two examples have the maximum possible rank for n x n 
matrices, as shown below. 

Theorem 6. Suppose that M is a symmetric n x n matrix with Mu + Mjj < 2Mij 
for all i and j . Then the symmetric Barvinok rank of M is at most max{n, ['^^/4J}, 
and this bound is tight. Thus, every matrix with finite rank has rank at most 
max{n, [n^/4j}. 

Proof. Subtracting Mu/2 from the ith row and column for all i does not change 
the rank, so we can assume that the diagonal entries of M are and hence, by 
hypothesis, the off-diagonal entries are non-negative. 
The statement is trivial for n ~ I. For n ~ 2, we have 
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For n = 3, we assume, without loss of generality, that A/12 > A/23. Then 
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where "00" is as in Remark [TJ For n at least 4, the proof is by induction, using the 
following lemma: 

Lemma 7. Let ji > 4. Suppose that for any (n — 2) x (n — 2) matrix N of finite 
rank, there exists a matrix N' of rank at most [(n — 2)^/4j, such that N' is identical 
to N except possibly in one diagonal entry, where the entry of N' is greater than 
or equal to the entry of N . Then any nxn matrix has rank at most [n^ /A\. 

Remark 3. The exceptional diagonal entry in the hypothesis makes the statement 
of the lemma slightly stronger than what is required for an inductive proof of an 
[n^/4j upper bound. However, we will use the lemma to establish the base cases 
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of the induction, n = 4 and n = 5, which require a weaker hypothesis because for 
n = 2 and n = 3 the [n^/4j upper bound on rank does not hold. 

Proof of Lemma^ Let M be an n x n matrix with finite rank. Without loss of 
generality, assume that the entry M12 is minimal among all off-diagonal elements. 
We apply the hypothesis to the principal submatrix indexed by {3, . . . , n}. Applying 
Lemma 01 we have a collection of at most [(n — 2)^/4j rank 1 matrices whose 
tropical sum agrees with M except for the first two rows and columns and possibly 
one diagonal entry, which, without loss of generality, we assume to be M44. For 
each 4 < i < n, take the rank 1 matrix which has arbitrary large values except for 
the {1,2, j} principal submatrix, which is: 

2Mu Mu + M2^ Mu 

Mii+M2i 2M2^ M2^ . 

Mu M2^ _ 

Note that since M12 was chosen to be minimal, we have that M12 < Mu + M2i- 

Finally, switching indices 1 and 2 if necessary, we can assume that M13 > M23, 
and hence Afi2 + M13 > M23. Then, we take two matrices which are "00" outside 
of the {1,2,3} principal matrices, which are, respectively, 
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recalling the meaning of "cxd" from Remark [TJ 

This yields a decomposition of M into at most [(n — 2)^/4j + (n — 3) + 2 = [n^/4j 
symmetric rank 1 matrices. □ 



To complete the proof of the Theorem[6l we note that taking all but the last term 
of (jll and ([5]) gives the hypothesis to Lemma [7] for n = 4 and 5. The desired upper 
bound follows by induction, which consists of applying Lemma [7] with A^' — N. 
Finally, Remark [2] shows that this bound is tight. □ 

Theorem 8. The dimension of the space of symmetric nxn matrices of symmetric 
Barvinok rank at most r is ("J^) — (" 9^^), which is the dimension of the classical 
secant variety, i.e. the space of classical symmetric matrices of classical rank at 
most r. 

Proof. Let D = ("J^) — (" 2^^)- The tropical secant variety is contained in the 
tropicalization of the classical secant variety, so the dimension is at most D, by 
the Bieri-Grovcs Theorem [2j Thm. A]. Thus, it is sufficient to find an open neigh- 
borhood in which the tropical variety has dimension D. For i from 1 to r, let 
Vi = (C, . . . , C, Vi,i, . . . , Vi,n) be a vector with C for the first i — 1 entries. Choose 
the coordinates fi+i.j to be smaller than all the Vi,j and let C be very large. Then, 

2vii fii4-i;i2 •■■ I'll + wi„ 

Vii + V12 2V22 ■■■ V22 + V2n 

Vll + Vin V22 +V2n ■■■ 2Vrn 



© Ul © • ■ • ® = 
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This matrix is an injective function of the vector entries Vij for i < r and j > i. 
Thus, it defines a neighborhood of the rth secant set of the desired dimension 

n + (71 - 1) + • • • + (n - r + f ) = 2 j - ^ j = D. □ 

4. Star tree rank 

Recall from the introduction that a star tree matrix is one which can be written 
as Tr{v'^ v) for € M" a row vector. The star tree matrices from a classical linear 
space in the space of n x n dissimilarity matrices defined by the tropical polynomials 

(6) XijXki © XikXji for i, j, k, and I distinct integers. 

In this section, the deficiency graph will always be taken with respect to this tropical 
basis. The following lemma is an immediate consequence of Lemma 31 

Lemma 9. Let M be an m x m star tree matrix, and let n > m be any integer 
and C any real number. Then there exists an n x n star tree matrix with M as the 
upper left m x m submatrix and all other entries greater than C . 

Unlike the case of symmetric Barvinok rank, the star tree rank is always finite. 

Theorem 10. For n at least 3, the star tree rank of a nxn dissimilarity matrix M 
is at most n ~ 2, and this bound is sharp. In particular, the dissimilarity matrix 
defined by Mij ~ min{i, j} has star tree rank n — 2. 

Proof. The proof of the upper bound is by induction on n. For n = 3, the equations 
in ^ are trivial, and thus every 3x3 dissimilarity matrix is a star tree matrix. 
For n > 3, let M be an n x n dissimilarity matrix, and denote by M' the upper left 
(n — 1) X (n — 1) submatrix. By the inductive hypothesis, we can write M' as the 
tropical sum of n — 3 star tree matrices. We can extend each of these to ti x n star 
tree matrices by Lemma El and their tropical sum will agree with M except in the 
last column and row. Let w be the vector defined by Wi = Mi„ + C for i < n and 
Wn = —C, for C a sufficiently large number. Then the tropical sum of the previous 
n — 3 matrices together with 7r(w^ w) equals M. 

Now let M be the dissimilarity matrix defined by Mij = min{i, j} as in the 
statement. We claim that the deficiency graph of M has chromatic number n — 2. 
For every i < j < k < I, we have 

(7) M,k + Mji = Ma + Mjk = i + .] < M,j + M^i =i + k, 

so the deficiency graph has an edge between ik and j7, and an edge between il and 
jk. We refer to these types of edges as "overlapping" and "nesting" respectively. 

We prove that the deficiency graph of M has chromatic number at least n — 2 
by induction on n. The case of n = 3 is clear. Let n be greater than 3 and fix a 
coloring of the deficiency graph of M. Let S be the set of nodes of the same color c 
as the node In. There is a "nesting" edge between In and every node other than 
those of the form li or in for some i. Thus, every node in S is either of the form li 
or in. Furthermore, because of the "overlapping" edges, there must be an integer 
m such that if li is in 5, then i < m and if jn is in 5, then m < j. Now consider 
the set of n — 3 nodes consisting of li for m < i < n and jn for 1 < j < m. By 
our construction of m, none of them is in S. Therefore, if they have distinct colors, 
then the coloring has at least n — 2 colors, which is what we wanted to show. 
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Otherwise, two of these nodes have the same color, and by the "overlapping" 
edges and symmetry we can assume that they are li and li', with m < i < i' , 
which have color c' ^ c. Any node jk with j < k and 1 < j < i will share an edge 
with one of these two nodes, so cannot have color c'. On the other hand, if the 
node II with / < m is adjacent to jk with j < k, then we must have I < j < I. But 
I < m < i, so jk cannot have color c' . Thus, we can assume that all of the nodes II 
with I < m also have color c' without changing the fact that the coloring is proper. 
With this change, the only nodes with color c are of the form jn. By restricting to 
the nodes with coordinates less than n, we have a coloring of the deficiency graph 
of the {n — 1) X [n — 1) matrix without the color c, so by the inductive hypothesis 
we're done. □ 

Remark 4. The matrix with maximal star tree rank in the previous theorem is in 
fact in the Grassmannian, i.e. it has tree rank 1. Indeed, from ([7]), we see that 
the four-point condition holds. Alternatively, M arises as the distance matrix of 
the following weighted tree. Let T be the caterpillar tree with n internal vertices, 
connected in order by edges of weight —1/2. The ith leaf vertex is connected to 
the ith internal vertex by an edge of weight i/2. For i < j, the distance from leaf i 
to leaf j is i/2+(j — — l/2)+j/2 = i, which is equal to the corresponding entry 
in the matrix AI from Theorem 1101 In order to make a proper phylogenetic tree, 
we should remove the first and last internal vertices and combine the adjacent edge 
weights. 

Next, we give a graph theoretic characterization of the star tree rank of 0/1- 
matrices. For M a 0/1 dissimilarity matrix, we define Gm to be the graph whose 
edges correspond to the zeroes of M. As in the case of symmetric Barvinok rank, 
we can characterize the star tree rank of M in terms of covers oi Gm, this time by 
both cliques and star trees. We will also say that a cover of Gm by cliques and star 
trees is a solid cover if for every pair of distinct vertices i and j either: 

(1) there is an edge between i and j, 

(2) either i or j belongs to a clique in the cover, 

(3) either i or j is the center of a star tree in the cover, or 

(4) both i and j are leaves of the same star tree. 

Proposition 11. Let M be a 0/1 dissimilarity matrix. Let r be the minimal number 
of graphs in a cover of Gm by cliques and star trees, such that every edge (but not 
necessarily every vertex) is in some element of the cover. Then M has star tree 
rank either r or r + 1. 

Moreover, if Gm has a solid cover by r graphs, then M has star tree rank r. 

Proof. Let Gi, . . . , G,. be a cover of Gm by chques and star trees. For Gi a clique, 
define Vi to be the 0/1 vector whose entries correspond to the vertices of the 
clique. If Gi is a star tree consisting of a central vertex Ci and edges to vertices 
in the set Li, then define Vi to be the row vector which is —1/2 in the Ci entry, 
1/2 for the entries corresponding to Li and 3/2 otherwise. In either case, define 
ALi = ^{vj Q Vi). Then Mi has entries corresponding to the edges of Gi. Thus, 
the tropical sum of the Mi has the same entries as M. Moreover, if the cover 
is a solid cover then the tropical sum is equal to M, so M has rank at most r. 
Otherwise, some of the positive entries of the tropical sum are greater than 1. By 
additionally taking the tropical sum with the all ones matrix, we see that M has 
rank at most r + 1. 
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Conversely, suppose that Mi = 7t{v'[ Q Vi) is a term in a representation of M 
as the tropical sum of star tree matrices. Then we claim that the zeroes of Mi 
correspond to cither a star tree or a complete graph. If all entries of Vi are non- 
negative, then the zeroes of Mi correspond to the complete graph on the vertices 
where Vi is 0. Otherwise, since Mi must be non-negative, there can be at most 
one negative entry of Vi, say with value —a; then all other entries must be at least 
a. Then the entries of Mi correspond to the star tree with edges between the 
entry with —a and the entries with value a. Thus, any decomposition of M as the 
tropical sum of star tree metrics yields a cover of Gm by cliques and star trees, so 
M has tropical rank at least r. □ 

We do not know if the definition of a solid cover can be weakened in any way. 
In other words, we do not know of any 0/1 matrices M such that r is the minimal 
size of a cover of Gm by cliques and star trees, and Gm does not have a solid cover 
of size r, but M has star tree rank r. 

In contrast to symmetric Barvinok rank, the upper bound of rt — 2 on the star 
tree rank of an n x n dissimilarity matrix cannot be achieved by a 0/1 matrix for 
large n. Recall that the Ramsey number R{k, k) is the smallest integer such that 
any graph on at least i?(fc, k) vertices has either a clique or a independent set of 
size k. Then we have the following stronger bound on the star tree rank of a 0/1 
matrix. 

Proposition 12. For n > Rik, k), any nxn 0/1 dissimilarity matrix has star tree 
rank at most n — k + 1. 

Proof. By the assumption on n, the graph Gm has either a clique of size k or an 
independent set of size k. In the former case, we can cover Gm by a star tree 
centered at each vertex not part of the clique, together with the clique itself. This 
gives a solid cover by n — fc -f 1 subgraphs. In the latter case, we can just take the 
star trees centered at the vertices not in the independent set, giving a cover of Gm 
by n — fc subgraphs. In either case, Proposition [Tl] shows that M has rank at most 
n-k+l. □ 

Corollary 13. For n > 18, every nxn 0/1 dissimilarity matrix has star tree rank 
at most n — 3. 

Proof. The Ramsey number i?(4, 4) is 18 [B]. □ 

Theorem 14. Let r and n be positive integers. Then the dimension of the space 
of dissimilarity nxn matrices of star tree rank at most r is 



Proof. Let D = min{("J^) — (" 2^^), (2)} be the dimension from the theorem 
statement. The dimension cannot be any larger than D by the Bieri-Groves Theo- 
rem [21 Thm. A] , because D is the dimension of the classical secant variety, according 
to Theorem 2 in [S]. Therefore, it is sufficient to construct a matrix with star tree 
rank r which has a D-dimensional neighborhood of star tree rank r matrices. If 
r > n, then D is (2), the dimension of the set of ?i x n dissimilarity matrices. Since 
higher secant sets cannot have smaller dimension, it is sufficient to assume r < n. 

We will construct r vectors vi, . . . ,Vr, with Vk,i denoting the zth entry of Vk, and 
then define M to be the tropical sum of the star tree matrices n{v'[ Vk). First, 
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we fix any order on the set of pairs of distinct integers S ~ {(hj) '■ r < i < j < n}. 
Then, for 1 < fc < r and r < i < n, we choose Vks as follows: if the fcth pair of 
integers includes i, then we choose Vk.i in the range < Vk.i < 1 and otherwise we 
choose Vk.i > 2. For i in the range k < i < r, we choose Vk.i inductively beginning 
with k ^ r. We choose u^.r arbitrarily. Then for k < r, and k < i < wc choose 
t^ka to be much greater than any of the Vk+ij already chosen. Finally, let C be a 
large real number and set v^j equal to C, for i < k. Let M be the tropical sum of 
the Mfc := tt{v'^ Qvk)- 

We claim that the set of matrices which can be gotten in this way forms a D- 
dimensional affine linear neighborhood. For i < r and i < j, the entry of M 
comes from Mi and in particular, is equal to Vij + Vij. For a fixed i, and taking 
j > i, these entries give us n — i linearly independent functions on the matrix M. 
Moreover, if is the fcth pair in the ordering on S, and k < r, then the 
entry of M comes from Mk and is equal to Vk.i+Vkj. These are linearly independent 
from each other and from all of the previous functions. Since the size of S is ("^'^), 
the number of linearly independent functions on the matrix M is 



(n - 1) + (n - 2) H h (n - r)+ min 



n — r 
2 



n\ n — r\ , n ~ r 



, ,- mm < r, , 
21 \ 2 J I V 2 

+ 1^ ^71 - r + 1^ 

which is the desired dimension, D. □ 

Remark 5. In fact, the difficult part of Theorem 2 in [9] is proving the lower bound 
on the dimension of the classical secant variety. Our computation of the dimension 
of the tropical secant variety provides an alternative proof of this lower bound. 



5. Tree rank 

The tropical Grasmmannian 6*2, « is the tropical variety defined by the 3-term 
Pliicker relations: 

(8) XijXki © XikXji ® XiiXjk for al\ i < j < k < i. 

This condition is equivalent to coming from the distances along a weighted tree 
which has negative weights along the internal edges [lH Sec. 4]. In this section, 
we will always take the deficiency graph to be with respect to the Pliicker relations 
in (HI). 

As with the previous notions of rank, the tree rank of a 0/1 matrix can be 
characterized in terms of covers of graphs. For any disjoint subsets Ii, . . . ,Ik C [n] 
(not necessarily a partition), the complete k-partite graph is the graph which has 
an edge between the elements of Ii and Ij for all i ^ j. Complete fc-partite graphs 
are characterized by the property that among vertices which are incident to some 
edge, the relation of having a non-edge is a transitive relation. 

Remark 6. The complete fc-partite graphs defined above are exactly those graphs 
whose edge set forms the set of bases of a rank 2 matroid on n elements. The transi- 
tivity of being a non-edge is equivalent to the basis exchange axiom. Alternatively, 
each of the sets /i , . . . , partition the set of non- loops in the matroid into parallel 
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classes. See [TT| for definitions of these terms. In the following proposition, we will 
see that the Pliicker relations imply the basis exchange axiom for the entries of a 
non-negative tree matrix. 

Proposition 15. Let M be an n X n 0/1 dissimilarity matrix and let r be smallest 
size of a cover of Gm by complete k-partite subgraphs. As in Provosition \lll we 
only require every edge to be in the cover, not necessarily every vertex. If Gm has 
at most one isolated vertex then M has tree rank r. Otherwise, M has tree rank 
r + 1. 

Proof. Let /i,...,/^ be disjoint sets defining a fc-partite graph. We construct a 
tree which has fc + 1 internal vertices: one vertex Vi for each i G [k] and a vertex 
w. Every element of li has a branch of length 1/2 to Vi and each Vi has a branch 
of length —1/2 to w. The elements of J = [n] \ (/i U • • • U Ik) are connected to 
w by a branch of length 1. This gives a distance matrix whose entries are for 
the edges of the fc-partite graph, 2 for the entries between elements of J, and 1 
elsewhere. If Gm has at most one isolated vertex, then the sum of these matrices 
is M. Otherwise, adding the matrix with all entries entries equal to 1 yields M. 

Conversely, suppose we have a decomposition of M as the sum of tree matrices. 
For each tree matrix, we can define a graph on the vertices [n] with edges corre- 
sponding to the entries. These form a cover of the graph of M, so we just need 
to show that the graph Gt coming from a tree matrix will be a complete fc-partite 
graph. For this, we need to show that the relation of having a non-edge is a tran- 
sitive relation among vertices which are incident to some edge. Suppose that {i,j) 
and (j, fc) are non-edges, but {i, fc) is an edge. Also suppose that j has an edge to 
some other vertex £. Then Mik + Mjg = 0, but My and Mjk are positive and M^g 
and Mig are non-negative, which contradicts the Pliicker relation. Thus, Gt is a 
complete fc-partite graph, and we have proved the theorem when Gm has at most 
one isolated vertex. 

Now suppose that Gm has at least 2 isolated vertices, i and j . There must be 
some tree matrix T in the decomposition of Gm such that Tij = 1. Suppose that 
the corresponding graph Gt has an edge between two vertices fc and £, which must 
be distinct from i and j by assumption. Since i and j are isolated, T^fc, Ta, Tjk, 
and Tjg must each be at least 1. But Tij + Tki = 1, which contradicts the Pliicker 
relation. Thus, Gt must be the trivial graph, so the decomposition of M must have 
one more term than a minimal cover of Gm. Therefore, M has rank r + 1. □ 

Note that by taking the /; in the definition of fc-partite graph to be singletons, 
we get complete graphs, and by taking fc = 2 with Ii a singleton and I2 any set 
disjoint from /i, we get star trees. Together with Propositions 1111 and fT5l this 
confirms, for 0/1-matrices, the second inequality in ([2]). 

Lemma 16. Let M be an m x m tree matrix, n > m an integer, and G any real 
number. Then there exists annxn tree matrix N , whose upper left mxm submatrix 
is AI and such that the other entries are each at least C . 

Proof. The matrix M encodes the distances on some weighted tree T on m leaves. 
Pick any internal vertex w of T and let C" be the smallest distance between v and 
a leaf of m. Let T' be the tree on n leaves formed from T by attaching each leaf 
i with m < i < n to f by an edge with weight maxjiC, G — C"}. Let TV be the 
distance matrix of T' and is a tree matrix with the desired properties. □ 
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Proposition 17. The dimension of the set of dissimilarity n x n matrices of tree 
rank at most r is the dimension of the classical secant variety, 




Proof. The tropical secant variety is contained in the tropicalization of the classical 
variety, which has the given dimension by [4l Thm. 2.1i]. Therefore, it is sufficient 
to prove that the tropical secant variety has at least the given dimension by the 
Bieri-Groves Theorem [21 Thm. A]. 

To prove the lower bound on the dimension, first note that for r = [?i/2j , n — 2r 
is cither or 1, so the first part of the statement implies that the dimension of the 
rth secant set is (2) , the dimension of the space of dissimilarity n x n matrices. 
Since higher secant varieties are at least as large as the precccding ones, this implies 
the second part of the statement. 

The proof of the first part, when r < n/2, is by induction on n. For n < 3 all 
dissimilarity matrices have tree rank 1, because the four-point condition is trivial. 
Thus, the dimension of the 1st secant set is (2) = (2) ^ ("2^^) ' ^ desired. 

Now suppose n > 3 and by the inductive hypothesis, let TV be an (n — 2) x (n — 2) 
matrix of tree rank at most r — 1 such that the locus of matrices with tree rank at 
most r — 1 has dimension ("2^) — ("^^'^) ^ neighborhood of N . 

Consider the caterpillar tree with leaves in the order 1, 3, 4, . . . , rt, 2, pendant 
edge length pi for leaf i, and negative internal edges (73, ... , qn-i- Thus, the first 
two rows and columns of the distance matrix Af are defined by: 

Ml, 2 = M2.I = Pl + 93 H 1- qn-l + P2, 

i-1 

= M,,i ^pi+Y,qj+ Vi for i > 2, 

n-l 

M2,i = M,,2 =Pi + ^qj+ P2 for i > 2. 

Just from the values in these two rows, we can solve for the values of the edge 
lengths: 



Pi = 


^(Ml,2 


+ Ml,3 


- M2,3) 




P2 


^(Ml,2 


+ M2,„ 


- Mi,„) 




Pi = 




+ M2,r - 


- A^l,2) 


for i > 2 


qi = 




-1 + A/2 




- M2,^) 



Therefore, the projection onto the first two columns has dimension 71 + (n — 3) = 
2n — 3 in a neighborhood of this point. 
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Assume that the pi are sufficiently negative that the lower right (71 — 2) x (n — 2) 
submatrix of Ad is less than TV in every entry. Then 



Ml 



00 
00 



00 
00 



00 
00 
N 





A/12 


A/13 ■ 


■ A/i„ 


A/12 




A/23 ■ 


■ M2n 


A/13 


A/23 




■ iVi,„_2 


A/i„ 


A/2„ 







has tree rank at most r and has a neighborhood of such matrices of dimension 



2n-3 



' n - 2 
2 

which is the desired expression. 



n — 2r 
2 



2?' 



□ 



Unlike the cases of symmetric Barvinok rank and star tree rank, we do not know 
the maximum tree rank of a ri x n dissimilarity matrix for large n. We have an 
upper bound of ri — 2 by Theorem I10[ and we can improve on this slightly: 

Theorem 18. For n > Q, a n x n dissimilarity matrix AI has tree rank at most 
n — 3. 

Proof. Let M be a 6 x 6 dissimilarity matrix. Consider the tropical polynomial 
whose terms correspond to the perfect matchings on 6 vertices: 

a;i2a;34X56 © Xi2X35X4e © X12X36X45 © Xi3a;24X56 © Xi3X25a:46 

© a;i3a;26a;45 © a;i4a;23a;56 © a;i4a;25a;36 © a;i4X26a;35 © 2:15X23X46 
© X15X24X36 © X15X26X34 © X16X23X45 © X16X24X35 © X16X25X34. 

Note that this is the tropicalization of the Pfafhan of a 6 x 6 dissimilarity matrix 
(see, for example [101 Ch. 7]). After relabeling the vertices, we can assume that 
the minimum is achieved by the term X12X34X56. In particular, this means that 
for the 4-point condition applied to the upper left 4x4 matrix, the minimum is 
achieved by X12X34. We can set Xi equal to the smaller of A/13 + A/24 — A/12 and 
A/14 + A/23 — A/12, so that the matrix 



TV 



is a tree matrix. Moreover, by our assumption, we have that Xi > A/34. By similar 
logic, there exist X2 and X3 such that the following is an expression of AI as the 
tropical sum of 3 tree matrices: 



* 


A/12 


A/13 


A/14 


00 


00 


A/12 




A/23 


A/24 


00 


00 


A/13 


A/23 




Xi 


00 


00 


A/14 


A/24 


Xi 




00 


00 


00 


00 


00 


00 


* 


00 


00 


00 


00 


00 


00 





TV© 



Therefore, AI has tree rank at most 3. 



* 


X2 


00 


00 


A/15 


A/16 






00 


00 


00 


00 


00 


X2 




00 


00 


A/25 


A/26 




00 




00 


00 


00 


00 


00 


00 


* 


00 


00 


00 




00 


00 




A/34 


A/35 


A/36 


00 


00 


00 


* 


00 


00 


© 


00 


00 


A/34 




M45 


A/46 


A/15 


A/25 


00 


00 




A/56 




00 


00 


A/35 


A/45 




X3 


A/16 


A/26 


00 


00 


A/56 


* 




00 


00 


M36 


A/46 


X3 


* 
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11 


niaximuni tree rank 


example 


3 


1 




4 


2 




5 


3 


0/1 matrix corresponding to 5-cycle 


6 


3 




7 


4 




8 


5 




9 


6 


M in © 


10 


6 or 7 


Any extension of M in © 


9k 


between 6k and 9fc — 3 


Mfc from discussion following ^ 



Table 1. Maximum possible tree rank of an n x n dissimilarity 
matrix, to the best of our knowledge. The upper bounds come from 
Theorems [To] and [TS] The examples have the largest tree ranks that 
are known to us. The omitted examples can be provided by taking 
a principal submatrix of a larger example, by Lemma [191 



For n, > 6, the theorem follows from Lemma [191 



□ 



Beginning with n = 10, we don't know whether or not the bound in Theorem 1 181 
is sharp. For the following 9x9 matrix, found by random search, the deficiency 
graph was computed to have chromatic number 6: 



(9) 



M 



1 6 
* 2 

2 * 
7 7 6 
2 9 

7 6 

5 1 

7 7 

1 1 



2 3 
9 7 
6 

3 3 



9 6 
7 1 
7 1 



Together with Theorem ll81 this computation shows that M has tree rank 6. For any 
fc > 1, we can form an 9k x 9k matrix Mk by putting M in blocks along the diagonal 
and setting all other entries to 10. The deficiency graph of Mk includes k copies 
of the deficiency graph of M, and all edges between distinct copies. Therefore, the 
chromatic number, and thus the tree rank, are at least 6k. 

On the other hand, in order to provide examples of an n x n matrix with tree 
rank n — 3 for all n < 9, we have the following lemma. 

Lemma 19. Let AI be annxn matrix. If any (n — m) x (n — to) principal submatrix 
has tree rank r, then M has tree rank at most r + m. 

Proof. Fix a decomposition of the (n — m) x (n — to) principal submatrix into r tree 
matrices. We can extend each tree matrix to an n x n tree matrix by Lemma |16I 
For each index i not in the principal submatrix, define Vi to be the vector which 
is C + Mij in the jth entry and — C in the ith entry, where C is a large real 
number. Then, the extended tree matrices, together with tt{vJ' Vi) for all i not 
in the principal submatrix, give a decomposition of M into r + m tree matrices, as 
desired. □ 
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These results on the maximum tree rank are summarized in Table [TJ 

6. Symmetric Barvinok rank for n = 3 

In this section, we explicitly describe the stratification of 3 x 3 symmetric matrices 
by symmetric Barvinok rank. By Theorem[6l the rank is at most 3, and the locus of 
rank 1 is the tropical variety defined by the 2x2 minors, so it suffices to characterize 
the matrices of rank at most 2. 

Following [7], we call a square matrix tropically singular if it lies in the tropical 
variety of the determinant. 

Proposition 20. Let M be a symmetric 3x3 matrix. Then the following are 
equivalent: 

(1) M has symmetric Barvinok rank at most 2; 

(2) The deficiency graph of M is 2-colorable; 

(3) M is tropically singular and Ala + Mjj < 2A/y for all 1 < i,j <S. 

Proof. First we note that Ma + Mjj < 2My implies that every term of the tropical 
determinant is greater than or equal to Mn + M22 + Thus, a matrix M 

satisfying these inequalities is tropically singular if and only if some other term of 
the tropical determinant equals Mu + M22 + 
Proposition [1] shows that (1) implies (2). 

If the deficiency graph is 2-colorable, then it can't have any loops, so Mu + Mjj < 
2Mij. Furthermore, the three diagonal entries can't form a clique, so without loss 
of generality, we assume that there is no edge between 11 and 22. Together with the 
inequality, this implies that A/ii+A/22 = 2^4^12, so A/n +A/22 +A/33 = 2A/i2 + Af33, 
so by our initial remark, Af is tropically singular. Therefore (2) implies (3). 

Finally, suppose that M satisfies (3). We can subtract Ma/ 2 from the ith row 
and ith column without changing the rank, and so we assume that every diagonal 
entry is 0. The inequalities then say that all of the off-diagonal entries are non- 
negative. For the minimum in the tropical determinant to be achieved at least 
twice, we must have at least one off-diagonal entry equal to 0. Without loss of 
generality, we assume that A/12 = 0. Then, 











Mi3 




"0 





00 




2A/i3 


A/13 + M23 


A/13 


M = 








M23 










00 




Ml3 + M23 


2A/23 


M23 




Mi3 


A/23 







00 


CO 


00 




A/13 


M23 






where 00 is as in Remark [T] Therefore, A/ has symmetric Barvinok rank at most 2. 

□ 



Remark 7. For larger matrices the symmetric Barvinok rank does not have as simple 
a characterization as the third condition in Proposition 1201 A necessary condition 
for a symmetric n x n matrix to have rank at most r is that Ma + Mjj < 2Mij 
and all the (r H- 1) x (r + 1) submatrices are tropically singular, but this condition 
is not sufficient. For n > 5 and r = n, there are n x n symmetric matrices with 
finite rank greater than n by Remark [2l Even for n = 4 and for r = 2 and r = 3, 
the matrix 

"0 12" 



2 10 
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has symmetric Barvinok rank 4, as the nodes 12, 13, 24, and 34 form a 4-chque 
in its deficiency graph. However, M and aU of its 3 x 3 submatrices are tropically 
singular. 

7. Star tree rank for n = 5 

In this section, we give an explicit characterization of the secant set of the space 
of star trees in the case n = 5. We do the same for the Grassmannian in the next 
section. 

From Theorem I10| we know that the maximum star tree rank of a 5 x 5 matrix 
is 3. On the other hand, the set of dissimilarity matrices of star tree rank 1 are 
defined by the 2x2 minors. Thus, our task is to describe the second secant set of 
the space of star trees, i.e. the set of dissimilarity matrices of star tree rank 2. 

First, we recall the defining ideal of the classical secant variety. The space of 
star trees is the tropicalization of the projection of the rank 1 symmetric matrices 
onto their off-diagonal entries. Its second secant variety is a hypersurface in 
defined by the following 12-term quintic polynomial, known as the pentad 15]: 

xi2Xi3X2AX-izXAb " xi2Xi^X2zX3aXaz - a;i2a;i4a;23a:^35a;45 + xi2XiAX2bX-iAXiz 
+xi2XizX2'3X:iiXiz - a;i2a;i5a::24a;34X35 + a;i3a;i4a;23a:^25a;45 - a:i3a;i4a:24a;252:35 
- a;i3a;i5a;23a;24a;45 + a;i3Xi5a;24a:^25a;34 - xiAXizX2:iX2bX'iA + a;i4a;i5a;23a:24a;35 

Note that the 12 terms of the pentad correspond to the 12 different cycles on 
5 vertices. The second secant set of the space of star trees is contained in the 
tropicalization of the pentad, but the containment is proper. Nonetheless, the 
terms of the pentad play a fundamental role in characterizing matrices of rank at 
most 2. 

Theorem 21. Let M be a 5 X 5 dissimilarity matrix. The following are equivalent: 

(1) M has star tree rank at most 2; 

(2) The deficiency graph of M is 2-colorable; 

(3) The minimum of the terms of the pentad is achieved at two terms which 
satisfy the following conditions: 

(a) The terms differ by a transposition; 

(b) Assuming, without loss of generality, that the minimized terms are 
a;i2a;23a;342;45a;i5 and xizX23X2AXi5Xiz, then we have that M14 + M23 < 

Mi2+M34 = Mi3 + A'/24- 

Proof. (1) implies (2) by Proposition [1] 

We show that (2) implies (3) by proving the contrapositive. Suppose that M 
doesn't satisfy the two conditions for any pair of terms. Without loss of generality, 
we assume that xi2X2'3X3iXAzXi5 is a minimal term from the pentad. By minimality, 
M12 + M34 is less than or equal to M13 + M24. If this inequahty is strict, then we 
have an edge between 12 and 34 in the deficiency graph. On the other hand, if it is 
an equality, then by our assumption that the conditions in (3) don't hold, M12-I-M34 
must be less than M14-I-M23, in which case we also have an edge between 12 and 34. 
Similarly, we have edges between 34 and 15, between 15 and 23, between 23 and 45, 
and between 45 and 12. Thus, the graph has a 5-cycle, and so is not 2-colorable. 
Therefore, M has star tree rank 3. 
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Finally, suppose that the two conditions in (3) hold. Let A 
Mi3 + M24. Then we claim that: 



M 



M12 + A/34 



* 


A/12 


A/13 


A ~ A/23 


00 














A/12 


=|! 


A/23 


A/24 


00 














A/13 


A/23 




A/34 


00 














A - A/23 


A/24 


A/34 




CO 














00 


00 


00 


00 
















* 


A/l4- 


f-A/25- 


-A/45 A/14- 


f-A/35- 


-A/45 




A/14 




A/15 


-A/25-A/45 








B 




A/14- 


l-A/25- 


-A/15 


A/25 


-A/35-A/45 




B 










A/14- 


f-A/35' 


-A/15 


M35 


A/14 


A/14- 


\-M25~ 


-A/15 Afi4+A/35- 


-A/15 




* 




Af45 


A/15 




M25 




A/35 






A/45 




* 



A/ir 
A/ir 



where 00 is as in Lemma Hand B = A/14 + A/25 + A/35 - A/15 - A/45. Note that 
each matrix has some entries taken from A/, while the rest are forced by the rank 1 
condition. We just need to check that the minimum of these two matrices is in 
fact M. To see this, we have the following inequalities from condition (3): 



M- 



12 



A/14 - 
- A/23 



M23 < A/12 
' A/34 + A/45 



A/ 



13 



A/ 



23 



A/,. 



A/4 



A/i 



23 



M: 



34 ■ 



A/ 



45 



A/34 ^ A/14 < A - A/23 
A/15 < A/12 + A/23 + A/35 + M45 + A/14 

=> A/34 < A/14 + A/35 - A/15 
A/15 < A/13 + A/23 + A/25 + ^/45 + A/14 

=^ A/24 < A/14 + ^/25 - ^/l5 
A/15 < A/14 + A/34 + ^/23 + ^/25 + A/15 
=^ A/12 < A/14 + A/25 - A/45 
A/13 + A^23 + M24, + A/45 + Mi5 < A/14 + A/24 + A/23 + A/35 + A/15 

=> A/13 < A/14 + A/35 - A/45 
M12 + M23 + A/34 + M45 + 7\/i5 < A/12 + ^/25 + ^/35 + A/34 + A/14 

^ M23 < B 
Therefore, M has star tree rank at most 2. 

8. Tree rank for n = 5 



□ 



We now turn our attention to tree rank of 5 x 5 dissimilarity matrices. As in 
the previous section, the maximum tree rank is 3 and so it suffices to characterize 
5x5 dissimilarity matrices of tree rank at most 2. Unlike the previous section, the 
second classical secant variety is already all of C^", so there is no classical polynomial 
whose tropicalization gives us a clue to the tropical secant set. However, the tropical 
pentad again shows up in our characterization. 

First, here is a simple example ofa5x50/l dissimilarity matrix with tree rank 
3. Consider the 0/1 matrix corresponding to the 5-cycle C5. Now, C5 cannot be 
covered by fewer than 3 fc-partitc graphs, and so the matrix has tree rank at least 
3 by Proposition [ini On the other hand, it has tree rank at most 3 by Theorem [TUl 
and the inequality in ([2]). We will see in Remark [8] that this matrix is, in a certain 
sense, the only such example. 

Let P be the tropical polynomial in variables {xij : 1 < i < j < 5} which 
is the tropical sum of the 22 tropical monomials of degree 5 in which each i G 
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{1, . . . , 5} appears in a subscript exactly twice. Thus P has 12 monomials of the 
form a;i2X23a;34X45a:i5, forming the terms of the pentad, and 10 new monomials of 
the form a;i2a;232;3ix|5. Let us call terms of the former kind pentagons, and terms 
of the latter kind triangles. 

Theorem 22. Let AI be a 5 x 5 dissimilarity matrix. Then the following are 
equivalent: 

(1) M has tree rank at most 2; 

(2) The deficiency graph is 2-colorable; 

(3) The tropical polynomial P achieves its minimum at a triangle. 

Proof. First, (1) implies (2) by Proposition [TJ 

For (2) implies (3), we prove the contrapositive. Suppose the minimal terms of 
P are all pentagons; without loss of generality, we assume that a:;i2a;232;34a;45a;i5 is a 
minimal term. Since xi4a;45Xi5a;23 is not minimal, we have Mi2 + M^4^ < Afi4 + A#23- 
Similarly, we have, 

Mi2 + M23 + M34 + M45 + A/15 < 2 A/15 + A/23 + A/34 + A/24, and 
M12 + A/23 + A/34 + A/45 + A/15 < 2A/45 + A/12 + A/23 + A/13. 

Adding these together and cancelling, we get A/12 + A/34 < A/13 + A/24. Thus, 
12 and 34 are adjacent in the deficiency graph. By similar reasoning, we have 
adjacencies 12 — 34 — 15 — 23 — 45 — 12 in the deficiency graph, so it has a five cycle 
and is not 2-colorable. 

Finally, we prove that (3) implies (1). Assume without loss of generality that 
a;34X35a;45xf2 is among the terms minimizing P. This implies that ^12X34, xi2Xj,^, 
and are each minimal terms in their respective Pliicker equations. Then we 

can use Lemmas [23] and [24] below to obtain a decomposition of M into two tree 
matrices. 

Lemma 23. For any 5x5 dissimilarity matrix M such that X12X34, X12X35, and 
X12X45 are each minimal terms in their respective Pliicker equations, there exists 
some 5x5 tree matrix T, such that for every ij G ('2'); have Tij > Mij, with 
equality tf ij G {12,13,14,15,23,24,25}. 

Proof. If M satisfies 



then adding shows that each inequality must be an equality. Thus, without loss of 
generality, we may assume that 



A/14 + A/23 < A/13 + A/24, 
A/15 + A/24 < A/14 + A/25, 
A/13 + A/25 < A/15 + A/23, 



(10) 

(11) 
(12) 



A/14 + A/23 > A/13 + A^24, 
A/15 + A/24 < A/14 + A/25, 
A/13 + A/25 < A/15 + A/23. 



Every other case is equivalent to this one via permutations of {1, 2} and {3, 4, 5}. 
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Figure 2. The two 2-colorablc possibilities for A 



Now define T as follows. Let T,j = il/y for ij G {12, 13, 14, 15, 23, 24, 25}, and 

T^i = ^24 + 2^13 ^ Ti2, 
T35 = T25 + T13 — T12, 
^45 = TiQ + T24 — T12. 

That T dominates Af in every coordinate follows from the inequalities (fTO|l . (fTTj) . 
and (|12|) . To check that T has tree rank 1, it suffices to check the four-point 
condition on each 4-tuple. This condition is satisfied for the 4-tuples {1,2,3,4}, 
{1, 2, 3, 5}, and {1, 2, 4, 5}, by choice of T34, T45. Furthermore, we claim that 

Ti5 + 734 = Ti3 + T45 < Tu + T35, and 

T24 + T35 ~ 725 + < 723 + 745 . 

These inequalities follow immediately from substituting for 734,735,745 and using 
inequalities ^ and □ 

Lemma 24. For any 5x5 dissimilarity matrix M , there exists some 5x5 tree 
matrix T' such that for every pair of indices i and j, we have T/^ > Alij , with 
equality if ij € {34,35,45}. 

Proof. The {3, 4, 5}-principal submatrix of M is a tree matrix and therefore T' 
exists with the desired properties by Lemma [TBI □ 

We can now finish the proof of Theorem [22l Let T be as given in Lemma [23l 
and let T' be as given in Lemma [24l Then M ~ T (B T' and so M has tree rank at 
most 2. □ 

In the rest of this section, we present a detailed analysis of the deficiency 
graphs Aj\/ for n ~ 5. There are 5 tropical Pliicker relations on a 5 x 5 matrix, 
each containing 3 terms. Each term is the tropical product of terms with disjoint 
entries. Thus, Ajv/ is a subgraph with at most 5 edges of the Petersen graph Pio, 
which is the graph on vertices ('2') with an edge between ij and kl if and only 
if {i,j} and {k,l} are disjoint sets. The following theorem describes the possible 
subgraphs that Aj\/ can be. 

Theorem 25. Let M be a 5 x 5 dissimilarity matrix. Then the deficiency graph 
Am is precisely one of the following: 

(1) The trivial graph, in which case M has tree rank 1. 
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(2) A non-trivial graph with fewer than 5 edges, in which case M has tree 
rank 2. 

(3) Up to relabeling, either of the two graphs in Figure\^ in which case M has 
tree rank 2. 

(4) A b-cycle, in which case M has tree rank 3. 

Proof. The matrix M is a tree matrix if and only if the four-point condition holds 
for all 4-tuples, i.e. if and only if Aa/ is trivial. This is the first case. 

Now suppose that Am is a non-trivial graph with at most 4 edges. Then, at 
least one four-point condition holds, so Lemma [T9l implies that M has tree rank at 
most 2. However, at least one four-point condition is violated, so M must have tree 
rank exactly 2. 

In the rest of the proof, we assume that Aj\f has 5 edges, i.e. that each of the 
five 4-tuples yields one deficient pair. Thus, A7\/ contains exactly one of the three 
edges 12 — 34, 13 — 24, and 14 — 23, and similarly for the remaining 4-tuples. Let 
us say that a subgraph H of Pio admits an alternating even cycle if there exists a 
cycle of even length in Pio such that, traversing the cycle, the edges are alternately 
members and nonmembcrs of H (Figure [3]). 

Lemma 26. The graph A^/ admits no alternating even cycle. 

Proof. Note that the only even cycles in the Petersen graph are of lengths 6 and 8 
(see, for example, [3j Ch. 4.2]). Suppose C is an alternating cycle of length 6. 
All 6-cycles of Pio are isomorphic, so after relabeling, we may assume that C has 
vertices 45, 13, 25, 34, 15, 23, in that order. Now suppose that 45 — 13, 25 — 34, and 
15 — 23 are the three edges of the cycle in H. Then 

M45 + ^/l3 < A/34 + Afl5, 
M25 + M34 < M23 + M45, 
Afl5 + M23<Mi3 + M25. 

But adding these inequalities yields a contradiction. If instead 13 — 25, 34 — 15, 
and 23 — 45 are the three edges of the cycle lying in H , then we obtain a similar 
contradiction. 

The case of a cycle of length 8 is analogous, so we omit it. □ 
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We now use Lemma [26] to show that, up to relabehng, the only two possibiUties 
for Aa/, assuming that it is 2-colorable, are those in Figure [21 We organize our 
case analysis according to the maximum degree in the graph Am ■ 

If Aj\/ has maximum degree 1, so that it is a perfect matching in Piq, then one 
may check that it has an alternating 8-cycle. This is impossible by Lemma [26l 

If Aj\/ has maximum degree 2, then it is cither a 4-path and a 1-path, a 3-path 
and two 1-paths, or two 2-paths and a 1-path. (By a fc-path we mean a path with 
k edges). It is easy to check that in each of these cases, Am admits an alternating 
6-cycle, which is again impossible by Lemma 1261 

Thus, Am has a vertex of degree 3, which we assume to be the vertex 12. In 
this case, one may check that, up to relabeling, Am is one of the two graphs shown 
in Figure [2] Note that in either case, the graphs are 2-colorable, and thus M has 
tree rank 2 by Theorem [22l 

Finally, if Aj\/ is not 2-colorable, then it must have an odd cycle. The Petersen 
graph has no 3-cycles, so Am must be a 5-cycle. □ 

Remark 8. If A/ is the 0/1 matrix corresponding to the 5-cycle C5, then Am is also 
a 5-cycle by Theorem 1251 Explicitly, Am has an edge for each non- adjacent pair 
of edges in the graph C5. Moreover, Theorem [25l tells us that any other matrix N 
with tree rank 3 must have the same deficiency graph (up to relabeling). In this 
sense, M is essentially the only example of a 5 x 5 dissimilarity matrix with tree 
rank 3. 

9. Open questions 

• What is the maximum tree rank of a n x n dissimilarity matrix? Theorem llSI 
gives an upper bound, but beginning with n = 10, we do not know if it is 
sharp. Specifically, does there exist a 10 x 10 dissimilarity matrix with tree 
rank 7? 

• Give a (reasonable) algorithm for computing tree rank. Note that both the 
tropical Veronese and the space of star trees are classical linear spaces, and 
so the results in [6] can be applied to compute star tree rank and symmetric 
Barvinok rank. However, it would be nice to have a good algorithm for 
computing tree rank. 

• Is it true that the rank of a matrix, according to any of our notions, is always 
equal to the chromatic number of the corresponding deficiency graph? In 
Sections [6l [3 and[8l we observed that this was true for symmetric Barvinok 
rank with n < 3, star tree rank with n < 5, and tree rank with n < 5 
respectively. In general, we believe the answer is no, but we do not know 
of a counterexample. 

• In phylogenetics, trees have all edges, including the pendant edges, labeled 
by positive weights, or, equivalently, after negating, by negative weights. 
In this way, phylogenetic trees form a subset of the tropical Grassmannian, 
and we define the 'phylogenetic tree rank to be the rank with respect to this 
subset. One can ask the same questions about phylogenetic tree rank as 
in this paper: Which matrices have finite phylogenetic tree rank? What is 
the maximum possible finite phylogenetic tree rank? What is an explicit 
characterization of the secant sets for small matrices? Work in this direction 
was begun in [5]. 

• What is an algorithm for computing phylogenetic tree rank? 
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Does a matrix have tree rank 2 if and only if all its principal 6x6 sub- 
matrices have tree rank at most 2? Pachter and Sturmfels have made the 
same conjecture for their definition of tree rank |T^ p. 124]. 
What is an explicit description of 6 x 6 matrices with tree rank at most 2, 
along the lines of Theorcm[22t' Does this help with the previous conjecture? 
There is a necessary condition coming from applying Theorem [22] to every 
5x5 minor, and another from the fact that the matrix must be in the 
tropicalization of the classical secant variety. However, these two conditions 
together may not be sufficient for a 6 x 6 matrix to have tree rank at most 2. 
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